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Abstract: Combinatory Categorial Grammar (CCG) is a radically lexicalized theory of 
grammar in which all language-specific information, including the linear order of 
heads, arguments, and adjuncts, is specified in the lexicon, from which it is projected 
onto sentences by language-independent universal type-dependent combinatory 
rules of low “slightly non-context-free” expressive power, applying to strictly adja- 
cent phonologically-realised categories. Syntactic and phonological derivation are 
isomorphic, and are synchronously coupled with semantic composition in a purely 
type-dependent rule-to-rule relation. 


1 Overview 


1.1 Goals 


The central problem addressed by Combinatory Categorial Grammar (CCG) is the na- 
ture of the mapping between sound and meaning. The goal is to achieve an explana- 
tory theory of natural language grammar that is immediately applicable to psycholog- 
ical and computational models of syntactic and semantic processing of spoken and 
written natural language, and of language acquisition by children. 


1.2 Data 


The data which are drawn upon in order to define CCG are facts generally agreed 
among linguists concerning long-range dependency, coordination, and prosodic 
structure, all of which give the appearance of displacement, or non-contiguity of 
elements that belong together semantically, such as governors (heads) and their com- 
plements. 

The data to which the theory has been applied is much more various, and includes 
corpus data, both labeled and unlabeled, that is used to train parsers, and the various 
test-sets that are used to evaluate them, including corpora of child-directed utterance, 
and psycholinguistic data. 
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1.3 Tools 


Crosslinguistic similarities and differences are represented in CCG solely at the level of 
the lexicon, which specifies alllanguage-specific properties including the linear order 
and semantic dependency of governors and dependent or complement constituents. 
The lexicon is projected onto the sentences of the language by “combinatory” rules— 
that is, by strictly string-adjacent operations, combining contiguous categories with- 
outthe involvement of any form of “action at a distance”, such as movement, copying, 
or deletion under identity. 

The only representational levels in CCG are phonological and logical form. Syn- 
tactic derivation is not itself a level of representation, and is dispensible. All syntactic 
rules are type-dependent, rather than structure-dependent, and assemble logical and 
phonological form in lockstep vvith syntactic derivation. The hypothesis is that the de- 
grees of freedom in the type-system of the lexicon and the combinatory rules are both 
necessary and sufficient for the analysis of the languages of the vvorld. The categories 
are those of categorial grammar. The relations between categories are combinatory in 
the sense defined above, and are fully formalized. 

The theory outlined in this chapter, and developed in slightly different forms and 
at greater length in earlier publications, has been applied to the syntactic and seman- 
tic analysis of coordination and unbounded dependeney in a vide range of languages. 
Ithas also been widely applied computationally in practical natural language process- 
ing (NLP) applications, particularly those requiring that the syntax support semantic 
interpretation. There is a CCG-based computational account of acquisition and de- 
velopment, based on semantic bootstrapping of the language-specific lexicon (Abend 
et al., 2017). There is a hypothesis concerning the origins of the categories and combi- 
natory rules in terms of their use for planning complex actions in human and prehu- 
man cognition (Steedman, 2002, 2017). Neither is discussed at any length here. Wide 
coverage parsers for CCG have been developed. 


1.4 Sample analysis 


The following sentence, selected by the editors for comparison across the various ap- 
proaches in this volume, is quite long: 


(1) After Mary introduced herself to the audience, she turned to a man that she 
had met before. 


1 The interested reader can try out the “Easy CCG” parser (Lewis et al., 2016) by typing or pasting 
sentences such as (1) into the input box at http://4.easy-ccg.appspot.com/, bearing in mind that this 
is a probabilistic parser, with a lexicon and parsing model primarily trained on the Penn VVS) tree- 
bank. 


14 Combinatory Categorial Grammar — 391 


Accordingly, its derivation is presented in Figure 1in three steps, with all discussion of 
semantics and logical form deferred until the detailed discussion of the constructions 
involved. 

First, the preposed adjunct After Mary introduced herself to the audience is derived 
syntactically as in Figure 1a. CCG derivations like this are written in the acceptance di- 
rection, with the words at the top and the “start symbol” (usually, S) at the bottom, 
but are otherwise equivalent to standard derivational phrase-structure trees. Slashes 
/ and) define the English transitive verb as looking for its first NP (object) argument 
to the right and its second (subject) argument to the left. Underlines indicate combi- 
nation, and the directional arrows > and < indicate that the rule involved is forward 
(rightvvard) or backvvard (leftvvard) application. The 1 notation indicates that the cat- 
egory in question such as NPİ has a type-raised or cased category such as the nomi- 
native category S/(SVNP), abbreviated here for readability. Since the derivation shown 
is entirely applicative, type-raising has no effect here other than to reverse the direc- 
tionality of the rule that combines verb and argument, so can be temporarily ignored. 
The binding of the reflexive anaphor “herself” is also lexicalized, via the logical form 
(not shown), whose details are discussed in Section 3.5. 

The main clauseinvolvesanunbounded relativized dependency,and is more com- 
plicated syntactically, making crucial use of composition and type-raising, as in Fig- 
ure 1b. 

This derivation crucially involves composition rules, indexed >B and >B... Their 
operation, whose details are discussed in Section 4, crucially depends on the argu- 
ments being type-raised. In particular, the subject “she” of the relative clause must 
bear the nominative raised category for the derivation to go through, although for 
the purposes of this overview, we continue to abbreviate it as NP. (Thus, English is 
highly ambiguous as to case, unlike morphologically cased languages like Latin and 
Japanese.) 

To complete the derivation, the sentential adjunct derived in 1a combines with 
the sentence derived in 1b by simple forward application, yielding a sentence, as in 
Figure 1c. 

Although the assembly of logical form is not shown in this introductory anal- 
ysis, its derivation is entirely compositional and homomorphic to the surface syn- 
tactic derivations shown. In particular, the logical form corresponding to the com- 
plex noun “man that she had met before” is under the analysis of relativization 
developed in Section 4 itself a property of type N with the appropriate logical form 
AnAx.past (perfect (meet x pro,p,)) A nx (cf. (25)). Such details are discussed at length 
in the body of the chapter. 

Anaphoric relations, including the binding of the pronoun “she” in the main 
clause to “Mary” in the adjunct, and the binding of the past tense of the main clause 
“turned” in an “after” relation to the antecedent reference time of the adjunct “intro- 
duced”, among others, are not treated in CCG as falling in the domain of sentence 
grammar proper. 
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1.5 Evaluation criteria 


The evaluation criteria for comparing CCG with alternative approaches are descrip- 
tive and explanatory adequacy, and applicability to practical computational natural 
language processing, including the building of logical form. 

Descriptive adequacy is attained by capturing all the phenomena of a system. Ex- 
planatory adequacy is attained by capturing only those phenomena, and being unable 
to capture other comparable phenomena that are not exhibited by the system. A the- 
ory which can express phenomena that vve believe can never occur is overly-expressive 
and less than explanatory (although such theories may be extremely useful in laying 
out the phenomena in ways that help us to find our way to more explanatory ones).” 

Since in the case of the grammatical system vve only have a fairly small sample 
of languages to work with, we don't have complete knowledge of the set of possible 
phenomena. It follows that any claim to explanatory adequacy in the theory of gram- 
mar is a hostage to fortune, and can be disproved by the discovery of languages that 
controvert its prediction of their non-existence. 

Nevertheless, the available descriptive accounts make grammar seem relatively 
systematic. For example, we shall see in Section 7 that CCG predicts that two of the 
24 possible ways of linearising the four elements corresponding to the English words 
comprising the noun-phrase “these five fat cats” are impossible, and will never be 
found in any natural language (cf. Greenberg, 1963, Cinque, 2005). A theory that is 
descriptively adequate in other respects, and also accurately predicts the same gener- 
alization concerning word orders in other constructions is empirically falsiftable, and 
therefore more explanatorily adequate than one that does not.” 


2 Historical background to CCG 


VVhen syntactic theory as defined in Chomsky, 1965 (hereafter, Aspects) fragmented 
in the '70s and ”80s, leading to the profusion of approaches assembled in the present 
volume, attempts to develop alternatives to Aspects-style transformational rules took 
two forms. 


2 While initially acknovvledging something like the above as a definition of explanatory adequacy, 
Chomsky (1965) proposes the provision of a theory of child language acquisition as a proxy for ex- 
planatory adequacy. Hovvever, once vve have admitted that some amount of innate knovvledge mustbe 
available for language acquisition to be possible at all, then every theory of language has a theory of 
acquisition if we assume that its key assumptions and constraints are by some evolutionary process 
innate. In order to avoid merely pushing the burden of explanation off onto the theory of evolution, 
an explanatory theory in the first sense is actually a prerequisite for an explanatory theory of child 
language acquisition. 

3 For reasons that have nothing to do with the theory of grammar, all such variation is Zipflan in 
distribution, vvith the long tail of less common orders becoming double-exponentially rarer, so that it 
isin practice hard to know whether an unseen word order is truly impossible, or just so rare that we 
haven't seen it yet. 
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One group of constraint-based theories, usually expressed in unification-based 
formalisms, were as overly expressive as Aspects transformational grammars, but 
were easier to implement computationally (and therefore to automatically check for 
over- and under-generalization). They included Augmented Transition Network Gram- 
mar (ATNG. Woods, 1970), Functional Unification Grammar (FUG, Kay, 1984), Lexical 
Functional Grammar (LFG, Bresnan, 1982), Dependency/VVord Grammar (DG/WG, 
Hays, 1964, Hudson, 1984), Head-driven Phrase Structure Grammar (HPSG, Pollard 
and Sag, 1987), Autolexical Grammar (ALG, Sadock, 1991), Role and Reference Gram- 
mar (R&RG, Van Valin, 1993), Sign-based Construction Grammar (SBCxG, Boas and 
Sag, 2012), Simpler Syntax (SS, yackendoff, 1997, 2002, Culicover and yackendofi, 
2005), Type-Logical Grammar (TLG, Moortgat, 1988, Hepple, 1990, Dovvty, 1993, Mor- 
till, 1994, Tacobson, 1999), and some versions of Montague Grammar (MG, Montague, 
1973, Bach, 1976, Cooper, 1983). 

A second group sought for formalisms that vvere much less expressive in the first 
place, such as Generalized Phrase Structure Grammar (GPSG, Gazdar, 1981), Combi- 
natory Categorial Grammar (CCG, Ades and Steedman, 1982, Szabolesi, 1989), Head 
Grammar (HG, Pollard, 1984), and Tree Adioining Grammar (TAG Joshi, 1988). 

m the same period, the transformational theory itself evolved through the Ex- 
tended Standard Theory (EST, Chomsky, 1972), the Revised Extended Standard The- 
ory (REST, Chomsky and Lasnik, 1977), and Principles and Parameters/Government- 
Binding (PS:P/GB, Chomsky, 1981), until the more radical reform of the current Mini- 
malist Program (MP, Chomsky, 1995b,a, 2000), defined by the assumption thatsyntac- 
tic derivation is determined by its function of creating objects that are phonologically 
and semantically vvell-formed, an assumption that (at least in aspiration) makes it 
more akin to the latter approaches, and in particular, the present approach of Combi- 
natory Categorial Grammar. 

m particular, both Minimalism and CCG are committed to the view that syntac- 
tic derivation works by language-independent principles, from which it follows that 
all language-specific properties of constructions must derive from the lexicon of the 
language, not via language-specific rules or constraints. 

m that sense, as Adger (2013) has pointed out, Chomskian Minimalism can be 
seen as a form of Categorial Grammar that adds Movement as the mechanism for han- 
dling discontinuous dependency, rather than the combinatory rules of CCG that are 
defined below. One of the purposes of the present chapter is to compare and contrast 
the movement theory of discontinuity with the alternative combinatory extension of 
Categorial Grammar proposed by CCG, as vvell as vvith the other alternative theories 
noted above. 

m order to make these comparisons, the presentation of the formal specifics of 
CCG viill be tied to the constructions in English and other languages that motivate their 
introduction. Our rules are empirically motivated, and it is only in the latter sections 
of the chapter that we turn to the question of why they take the form that they do. 
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The constructions in question fall into tvvo groups. The first group comprises the 
“bounded” constructions like raising, control, passive, unaccusative, intransitiviza- 
tion, the binding of reflexive pronouns, etc., which concern relations or alternations 
in relations among the arguments of a single head, such as a verb, together vvith re- 
lated matters like agreement and case. These are phenomena on which all theories 
more or less agree, differing only in the degree of lexicalization vs. syntacticalization 
that they assume, with CCG occupying the radically lexicalized end of the spectrum. 
These are dealt with fairly briefly in Section 3.” 

The remaining sections of the paper concern a much more problematic range of 
constructions which we vvill 1oosely refer to as “unbounded”, which constitute a much 
more difficult problem for the theory of grammar, and for which CCG presents a rad- 
ically different analysis from other theories. They include relativization and its allies 
such as topicalization and v/h-question formation, together with its subspecies such 
as “pied-piping” and “parasitic” extractions, all of which have been attributed to un- 
bounded movement, and are the subject of Section 4, which introduces all the remain- 
ing syntactic operations, and shows that the v/h-constructions can be analysed with- 
out movement. Section 5 then goes on to show that various forms of coordination re- 
duction, which have elsewhere been attributed to deletion under identity, copying, or 
parallelism, can be eliminated under exactly the same assumptions as movement. 

Section 6 then briefly reviews the notion of constituency that is implicit in CCG, 
and notes that English intonation structure and its semantics reflect exactly the same 
notion of derivational constituency as the earlier constructions, without the stipula- 
tion of extra-syntactic features such as “edges” and non-syntactic processes such as 
“Focus Profection”. 

Section 7 is more technical, and addresses the question of the degrees of freedom 
that have been exercised in achieving this account, and the explanatory adequacy of 
the theory that results. A brief conclusion then sums up. 


3 Pure Categorial Grammar (CG) 


The pure Combinatory Grammar of Ağdukievvicz and Bar-Hillel eschews language- 
specific syntactic production rules like (2) for English. 


Q) s — NP VP 
VP — NP 
TV Íproxed, found, sees, ...) 


4 Many of the constructions of central concern to Construction Grammarians arguably belong in this 
class of lexically-governed constructions. 
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Instead, the same language-specific syntactic information is lexicalized, via lexi- 
cal entries like (3) for the English transitive verb: 


(3) sees :=(SNVNP,;;)/NP 


This “category” identifies the transitive verb as a function or governing category, spec- 
ifying the type, directionality, and agreement of its NP arguments and the type of its 
result, S. Thus, it specifies “sees” as a transitive verb vvanting an NP on the right (with 
unspecified agreement) as its first argument, to yield a function vvanting an NP bear- 
ing third singular agreement on the left, to yield S.” 

The lexical notation for Chomskean Minimalism is essentially categorial (Chom- 
sky, 1995a, 2000, Stabler, 2011, Adger, 2013): 


(4) sees :: (-D-case, =D V] (“yields V, selects tvvo D (NPs), 
assigns case to the first”) 


(The above is Chomsky”s notation, which omits directional alignment, like a catego- 
rial grammar with non-directional slashes | X, which loses CCG”s transparency to 
language-specific linear order of governors and arguments. Stabler also discusses a 
Directional Minimalist Grammar (DMG) with =X and X= directionality, parallel to /X 
and W.) 


3.1 The categorial lexicon 


When stated in full, categories also specify a semantics or logical form, as in (5a), as 
anatomized in (5b), in which the separator “:=” pairs a phonological/graphological 
form vvith a category, and the separator “:” pairs a syntactic type vvith a logical form: 


(5) a. sees := (SVNP5,)/NP : AxAy.sees x y 


category 
phonological form syntactic type logical form 
A —— ————- 
b. sees = (S a MWPsə/NP : Ax). Seesxy 
feature A-binders predicate-argument 


structure 


The predicate-argument structure component of logical form is assumed to be cross- 
linguistically universal, although elements like sees are of course a proxy for more 
complex structures involving tense, aspect, etc. Predicate-argument structure is there- 
fore essentially equivalent to the lexical component of thematic structure in Minimal- 
ism, f-structure in LFG, ARG-ST in HPSG, the grammatical function tier of SS, and 
dependency structures in DG/WG, although unlike some of these formalisms, CCG 


5 3s agreement is of course specified by -s morpho-phonology by a process discussed later in this 
section, and in Section 7. VVe assume a standard mechanism of simple non-recursive feature-value 
unification. 
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does not include explicit röle-labels, or define these structures as linearly ordered or 
aligned. 

The use of an explicit representational level of linguistic form, distinct from syn- 
tactic derivation, is a point of difference from the TLG tradition of Lambek in Catego- 
rial Grammar. While TLG often presents interpretations as A-terms, they are included 
purely for ease of reading, and are proclaimed to be dispensible. The semantics itself 
is defined by “direct surface composition” on the syntactic derivation itself (facobson, 
1999—see Bozşahin, 2012, 87-106 for discussion). 

VVhile direct surface compositionality is technically possible for CCG, there is a 
good reason to include a representational level of logical form. Since the only plausi- 
ble account of child language acquisition is that it is semantically bootstrapped from 
a prior representation of meaning, and since that meaning representation must be 
independent of the surface syntax of any specific language, it must be a language- 
independent logical form. It is therefore syntactic derivation, rather than LF, that is 
dispensible as a level of representation. 

To take a slightly more complex lexical verb, the following is the category for a 
subject control verb for a sentence such as He promises her to leave, again anatomized 
as (b): 


(6) a. promises := ((SNNPs,)/VP,,)/NP : AxApAy.promises (p y) xy 


category 
phonological form syntactic type logical form 
——.—. === —— 
b. promises m ((ŠS fn WP,s;)/VP,o)/NP : Ax Ap Ay. promises (py)xy 
feature A-binders predicate-argument 


structure 


The control relation betvveen the subiect and the infinitival VP complement is en- 
tirely captured at the level of the logical form, via the variables y and p. Here, the 
minimalist notation would be slightly different, treating the infınitival complement as 
a “small” clause, and treating the relation to the surface syntactic subject as mediated 
by movement or an anaphoric element. To that extent, CCG can be seen as lexicaliz- 
ing the A-movement analysis statically, via the use of bound variables at the level of 
logical form. Because such lexicalization is by definition limited to relations betvveen 
co-arguments of the verb, via the logical form, it necessarily obeys minimality con- 
ditions variously identifted as “sub/acency”, “relativized minimality”, “the Minimal 
Link Condition”, etc., as do GPSG/HPSG/LFG also. 

In CCG, all bounded constructions, such as passive, reflexivization, raising, and 
control, are lexically governed in a similar vvay, so that all instances of so-called A- 
movement are specified statically in the lexicon via the logical form of the governor.” 


6 It follows that the phenomenon of so-called Backvvard Control, in which an explicit subject in an 
infinitival complement clause appears to bind an implicit argument in the matrix clause, as has been 
proposed for Tsez by Polinsky and Potsdam (2000), cannot be handled as control in CCG. 
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As in any theory of grammar, rather than listing every single lexical category in its 
own right, we may vvant to capture “parametric” generalizations across the lexicon for 
any given language via “lexical redundancy rules” Jackendoff (1975), such as that not 
only “sees”, but every transitive tensed main verb has an SVO syntactic type, or that 
some identifiable class of the same transitive verbs including “eat” but not “find” map 
systematically onto a corresponding class of intransitivized verbs. Some, like intransi- 
tivization itself, vvill be morphologically unmarked, others like passivization, marked. 
Such regularities will allow the language learner to infer the existence of other mem- 
bers of such paradigms when they first encounter a novel verb, and possibly to learn 
more rapidiy. They also have the advantage of allowing the grammar to be represented 
more compactiy, although the fact that all of these generalizations are liable to admit 
of exceptions or irregularities shows that this is not the only consideration, and it may 
vvell be the case that such paradigms are compiled out into multiple lexical entries in 
their own right, as soon as encountered. All such solutions are formally equivalent, 
and the present chapter remains agnostic as to which should be preferred. 

The present chapter maintains an open mind about exactly how those lexical gen- 
eralizations should be captured. Give or take the notational idiosyncrasies applying 
to the lexicon, and the use in addition in some of the other theories discussed in this 
volume of language-specific syntactic rules, all theories are pretty much equivalentin 
this respect, and can be applied to the categorial lexicon. Accordingly, we will pass 
them over in this chapter, to concentrate on the syntactic component, which is more 
distinctive to CCG. 


3.2 Syntactic rules l: Pure application 


Verb categories like (5) and (6) combine most simply by the following rules, which are 
universal and language-independent: 


(7) a. Forward Application: 


XiY Y > X O) 
b. Backward Application: 
Y XY > X (o 


X and Y can be any syntactic CCG type, and may include simple feature-value pairs, 
such as agreement. > reads as “the things on the left combine to yield the thing on 
the right”, and is entirely analogous to the reverse of the revvrite arrow of the PS rules 


Cormack and Smith (2002) show that the construction in Tsez is limited to just two verbs, meaning 
“begin” and “continue”, and that the supposedly controlling infinitival complement subject cannot 
be referential, as with a universally quantifted NP. Both restrictions are reminiscent of there insertion 
in English, which is limited to raising verbs and non-referential subjects, so it seems reasonable to 
assume that so-called backvvard control arises from some form of lexicalized equivalent of expletive 
insertion at the level of lexical logical form. 
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in (2). s and c are annotations in derivations indicating the application of the rule 
in question. The slash-type x on the functor categories in (8) means that any functor 
category of the form X/Y or XAY can combine by these rules. 

Such rules, like all rules in CCG, correspond to the Minimalist operation of (exter- 
nal) merge, with the effect of “canceling” Y term, as if they were rules of fractional 
multiplication. They are analogous to Minimalist “feature-checking” between argu- 
ment and governor or head (Adger, 2003, 90-96). 

Such rules are rules of semantic merger, as well as syntactic. Thus we can extend 
them asfollovvs, with “:” again acting asan uninterpreted separator between syntactic 
type and semantic interpretation or logical form: 


(8) a. Forward Application: 


XIY:f Y:a > X:fa O) 
b. Backward Application: 
Y:a XUY:f => X:fa (Q 


Suchrules are both syntactically and semanticallyrules of functional application, 
or combination of functions with their arguments, as in the following derivation, in 
which syntactic derivation and the composition of logical form are synchronous and 
homomorphic. 


(9) Harry sees Sally 
NP, (SNNP,;Ə/NP NP 
: harry : AxÀy.sees x y : sally 


SVNP,, 
: Ay.sees sally y 
< 


S 
: sees sally harry 


(The absence of any explicit slash type on the categories in the derivation (9) means 
that those categories can combine by any rule, including some discussed below that 
are more restricted than the application rules.) 

Since categories like (6) achieve the effect of “A-movement” via A-binding at the 
level of logical form, the application rules are all that is needed to capture the phe- 
nomenon of control, as in the following derivation: 


(10) Harry promises Sally to leave 
NPsun ((SNNPs,)/ VP, )/NP NPs., VP, f VP VP 
: harry : AxApAy promises (py)xy :sally :Ap.p :Ay.leavey 
> > 


SAP VP. 
: ApAy.promises (py) sallyy : Ay.leavey 


SANP : Ay.promises (leave y) sally y 


S : promises (leave harry)sally harry 


Such purely applicative derivations will correctly form “chains” of raising and 
control relations in examples like the following, which are left as exercises: 
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(11) a. Harry promises Sally to persuade Alice to leave. 
b. Harry seems to promise Sally to leave. 
c. Harry wants to try to begin to write a play. 


3.3 The Combinatory Projection Principle 


The application rules in (8) constitute directionally specified forms of the simplest 
“external” form of the Chomskian Minimalist operation “Merge”. They conform to a 
simple generalization which governs all rules in CCG: 


(12) The Combinatory Projection Principle (CPP) 
Syntactic combinatory rules are binary, linearly-ordered, type-dependent rules, 
applying to string-adjacent categories, whose linear order is consistent with 
their directionaltypes, and project unchanged the type and directionality of any 
argument in the inputs that also appears in the result. 


This principle is defined more formally in Steedman, 2000 in terms of three more 
fundamental principles of Adjacency or Contiguity, Directional Inheritance, and Direc- 
tional Consistency, and forbids rules like the following, which combine forward func- 
tions backvvard (a), combine inner arguments before outer (b), or switch directionality 
between input and output (c): 


(13) a. Y:a X/Y:f s Xfa 
b. (X/Y)/Z:f Y:a ə X/Zifa 
c. (X/Y)/Z:f Za € XV:fa 


All bounded constructions—that is, those defining relations over the arguments 
ofa single head, such as raising, control, passive, unaccusatives, causatives, etc.—are 
defined morpholexically in CCG. Where there are systematic relations or alternations 
between subcategorizations by the same head, as in the door opened, the door was 
opened, Harry opened the door, the door opened itself, etc., these may be mediated 
by morphological markers, or by lexical rules, or by autonomous lexical entries, or 
by some mixture of the above, all of which may admit of phonological regularities 
and exceptions, as in many other theories mentioned above, such as LFG, HPSG, SS, 
etc. While such choices may be extremely important to efficient representation of the 
grammar for purposes of processing or acquisition, they are all formally equivalent, 
and vill not be distinguished here. 


3.A Case and morpholexical type-raising 


Case is assumed to be a universal primitive of grammar (cf. Vergnaud, 1977/2006). That 
is to say that all noun-phrases (NP) like “Harry” are (polymorphically) type-raised in 
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the morpholexicon. Type raising is so-called because it assigns to predicate-argument- 
structural arguments the category of a higher-order function over predicates that take 
NPs like “Harry” as an argument. For example, in place of (9), we have 


(14) Harry sees Sally 
S/(SNNPs;) (SVNPŞ,)/NP (SWNPss)V(SNNPss;)/NP) 
: App harry : AxÀy.sees x y : App sally 
SVWPy, i 
: Ay.sees sally y 
5 
: sees sally harry 


The effect of type-raising is to svvap the roles of function and argument between 
subject and predicate, so that the forward application rule (Sa) applies, rather than the 
backvvard rule (8b), and vice versa. crucialiy, the logical form that results is identical 
to that in (9). 

Type-raising is the job of case morphemes like the nominative suffixes -ga in 
Japanese and -us in Latin, as in Figure 2a, in which the N double-slash indicates a 
morphemic function that can only appiy inside the lexicon. In contrast, English NPs 
are underspecifted as to case, as in Figure 2b and c (the latter involves first-person 
subject pro-drop, represented in the logical form as anaphorically bound one,,). 

Thus, even in English, type-raising is an operation of the lexical componentofthe 
grammar, not a syntactic rule.” 

From now on we will usually abbreviate English underspecifted type-raised NP 
etc. as NP! etc., with the meaning “whatever type-raised NP category is required for 
the derivation”. Determiners will accordingly be written as NPT/N. 

Type-raising (Case) makes arguments into function categories that are more like 
adjuncts or specifters than like complements (Adger, 2013). Adger uses such type- 
raising to avoid problematic “roll-up” derivations under a minimalist approach. This 
use seems parallel to the use in CCG of lexicalized type-raising to capture pied-piping 
relatives and in situ wh in examples like (32) below. 


3.5 Reflexive anaphora 


VVe assume for present purposes that reflexive pronouns are clitic, like French se. The 
boundedness of reflexivization then arises from the fact that cliticization is a mor- 
pholexical process, despite the fact that in both languages the clitic in question is 
vvritten as a separate vvord. 


7 Of course, the processor might choose for reasons of efficiency to leave case under-specifted, and 
apply type-raising dynamically in context, under the control of a parsing “oracle” such as a statistical 
parsing model. (This is in fact how all CCG parsers for English work.) However, that does not imply 
that it is a rule of syntax. 
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We have the following categories for clitic “himself”, in which the morphological 
slash N restricts its application to lexical verbs: 


(15) himself :- (SWWP,,,)W((SVNP,,,)/NP) : ApAy.p (self y) y 
(SVVPsy)/PPAXÇSVNP,,,)/PP)İNP) : ApAu)Ay.p vü (self y) y 
etc. 


Syntactically, these categories are accusative instances of type-raised cased NPİ. Hovv- 
ever, their semantics is not T.Š 

The derivation for a simple reflexive transitive clause is the following, where 
self harry evaluates to harry: 


(16) Harry sees himself. 
NPÍ (SVVP.)/NP (SVNPs;;,)N(SVNP,;,)/ NP) 
: App harry : AxÀy.sees x y : ApAy.p (self y) y 


<LEX 
SVNP3un : Ay.sees (self y) y 


S : sees (self harry) harry 


For reflexive ditransitives of the kind we saw in (1), we have the following: 


(17) Mary introduced herself to the audience. 
NPİ ((SVNP,)/PP.)INP ((SVNP,y)/PP)M((SVNP,,,)/PP)/NP) PP! 
: Ap.p mayy : ÀAxAwAy.introducedw xy : ApAV/Ay.p w (self y) y : Ap.p audience 
(SNNP,,y)/NP : AvvAy.introduced w (self y) y bikə 
< 
SNNP,;sy : Ay.introduced audience (self y) y 
> 


S : introduced audience (self mary) mary 


It seems reasonable to assume that Harry talks to himself is also a true se-type 
reflexive arising from lexicalization of “talks to”, as in the following derivation:? 


(18) Harry talks to himself. 
NPL (SVNPs,)/NP ((SWPss,))X((SVNP;;)/ NP) 
: App harry : AxÀy.talks (to x) y : ApAy-p (self y) y 


SVNB,;,, : Ay.talks (to (self y)) y 
S : talks (to (self harry)) harry 


Example (19a) can be analysed similarly to (17). However, the reflexives in (19b-e) 
cannot reasonabiy be analysed as clitic in the same way, and must be “exempt” or 
logophoric pronouns, of a kind to be discussed below: 


(19) a. Harry showed himself a movie. 
b. Harry showed a movie to himself. 


8 The analysis is similar to that of Szabolcsi (1989), which is also lexicalized. 
9 This possibility may be related to the cross-linguistically unusual possibility in English of 
“preposition-stranding” v/h-extraction. 
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c. Harry talks to and about himself. 
d. Harry talks to only himself. 
e. Harry sees and admires himself. 


The following further “subject reflexive” instance of the type-raised reflexive for 
the non-existent “*heself”, (20) is excluded for English because it is not a possible En- 
glish cased category, since (SUNP)/NPs;,, is not an English transitive verb category: 1 


(20) *heself:=(S/NP)//((SNNP)/NPss,,) : ApÀx.p x (self x) 


The CCGi identification ofa languages case-system with type-raising over its verbal 
categories therefore predicts the “anaphor agreement effect” of Rizzi (1990), rather 
than requiring it as a stipulative constraint, thereby capturing Condition A of Chomsky 
(1981). 

The presence in English of “logophoric” reflexives that are homophonous to the 
reflexive, but are non-clause bound, like pronouns, is a source of confusion. Such 
forms are exempt from the binding conditions, and refer to the individual whose vievv- 
point the text presents (fackendofi, 1972, Higgins, 1973, Zribi-Hertz, 1989, Pollard and 
Sag, 1992), as in: 


(21) a. The fact that there is a picture of himself; hanging in the post office is be- 
lieved by Mary to be disturbing Tom;. 
b. A fear of himself; is fohn” s greatest problem. 
c. John saw a picture of himself. 


We will assume following Pollard and Sag that cases attributed to “reconstruc- 
tion” like the following in fact arise from the involvement of exempt logophoric pro- 
nouns of this kind, rather than from true reflexives. 


(22) a. Which pictures of himself; did Harry; see? 
b. Alice wonders which pictures of himself; Harry; saw. 
c. Alice wonders who; saw which pictures of himself;. 


4 The unbounded wh-constructions 


Relativization and allunbounded dependencies crucially involve type-raising and the 
syntactic combinatory rules of function composition, considered next. 


10 (SNNP)/NPss,., is in fact the category ofan ergative transitive verb with absolutive agreement. Wool- 
ford (1999, 267-8) shows that ergative languages with absolutive agreement such as Inuit and Nez 
Perce disallow the equivalent of “Heç, sees himself ;,,”, which would require raising over the English 
transitive type, while ergative languages without absolutive agreement, such, as Enga, do allow them. 
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4.1 Syntactic rules Il: Composition 


We will need the following rules of composition, which constitute all and only those 
allowed by the CPP (12) for these (first-order) categorial types: 


(23) a. Forward Composition: 


Xİi,Y:f Y/Z:g => XİZ:Azf(gz) (>B ) 
b. Backward Composition: 

YAZ:g XA.Y:f = XWV:MXzJf(gz) (<B ) 
c. Forward Crossing Composition: 

X/Y:f YZ:g => XV :)X.J (gz) (>B.) 
d. Backward Crossing Composition: 

Y/Z:g XVY:f > Xİ/Z:Azf (gz) (<B.) 


Like the application rules (8), these rules have the effect of “canceling” Y, as if 
this vvere fractional multiplication. The types o and x on the slashes in these rules 
mean that only categories whose own slash type is compatible under a type-hierarchy 
of slash types (Baldridge, 2002) can combine by theserules. The simplifted convention 
used in the present paper is that only categories with o slashes or unrestricted slashes 
can combine by o rules, and o categories cannot combine by the crossing x rules. 
Similarly, categories with a x or unrestricted slash-type can combine by x rules, but x 
categories cannot combine by 6 rules." 

Therules in (23) obey the CPP (12), including the Principle of Adjacency or contigu- 
ity. The absence of slash-typing on the secondary function YIZ means that it can apply 
to any type, but the CPP requires that that type will be passed to the result XIZ. Thus, 
as with the application rules (8), in Minimalist terms, the composition rules (23) con- 
stitute additional cases of External Merge, except for allowing the equivalent of some 
“feature-checking” that is not allowed under Minimalist merger (Adger, 2003, 93-94). 
However, they thereby achieve the same result as Minimalist Internal Merge, or Move. 

To see this in the context of the relative clause, we will assume the following cat- 
egory for the English relative pronoun: 


(24) that := (NN .N)/,(S/NP) 


We can then derive a relativized noun modifter “that she had met” of type NN ¿N (from 
the introductory example (1) and Figure 1b, slight simplifted), as follovvs, using the 
first of the composition rules (23a) to form a constituent of type SNNP adjacent to the 
relative pronoun by adjacent merger of the elements of the residue of relativization. 


11 However, another less restrictive convention is possible, in which these two slash types are explic- 
itly confunctive, written o and x, while o and x types can only compose. 
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(25) (The man) that she had met 
(NX3N)/,(8/NP) S/(SWP) (SMNWP)/VPy;rcpl VP ə, NP 
: ApAnAx px A nx : Àp.ppro;x, : ApÀy.past (perfect (p y)) : AxAy.meet xy 
>B 


S/VP 
š Ap.past (perfect (p PTOşe)) 
>B 


S/NP 
: Ax.past (perfect(meet x pro.,,)) 
> 


NXN 
: AnAx.past (perfect(meet x pro,.)) An x 


&8.1.1 Unbounded relativization 
The same combinatory rule (23a) can apply recursively, to build a constituent of the 


same type S/NP by multiple adjacent mergers, to which the relative pronoun can apply 
as before to yield a noun modifter (semantics omitted): 


(26) (The man) that she says that she met 
(NV 2N)/,(5/NP) NP! (SNNP)/,S" S” /,S NPİ (SVNP)/NP 
— A: = — = 
S/.S' S/NP 
S'/NP 

>B 
S/NP 
NV.N ; 


4.1.2 Embedded subject extraction 


However, the o slash-types of the complement of think correctly forbids extraction of 
the subject of a that complement: 


(27) “mena that [she says that] s¿s [met herlsiyp 


This is nota stipulation that could be otherwise: if a language like English allowed says 
or says that to compose with sees her by crossed composition it would immediately 
allow such non-English orders as the following with the meaning “she says the men 
met her”: 


(28) “She the men says met her. 


Such considerations lead us to expect that a general prohibition against extrac- 
tion of complement subjects should be a specific characteristic of rigidly SVO lan- 
guages and constructions, in which the directionality of S and O arguments differ, but 
not of SOV and VSO, in which they are the same, a generalization that appears to be 
correct (Pesetsky, 2017). 
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It also follows that in order to overcome the general prohibition on subject extrac- 
tion out of bare complements for the class of English verbs that allow them, we need 
to give them an extra lexical category, like the following: 

(29) think, believe, ... := (SVNP)/NP, yazar /(SNNPazy) : ApAxAy.think(px)y 


This category combines vrith a tensed predicate to yield something requiring an NP 
marked as --V/H which can only reduce with a relativized category, as in the derivation 
of “lal man who 1 think likes Lester” in Figure 24.” 


4.1.3 Parasitic extraction 


For completeness, we note in passing that a further class ofrule including the one here 
indexed -S,, constituting a “duplicating” generalization of composition is needed to 
capture “parasitic gapping” cases of relativization, again building a constituent of 
type S/NP by a succession of adjacent mergers: 

(30) (The articles) that Harry refected without reading 

(NX QN)/..(S/NP) NPİ (SNNP)/NP ((SNNP)VSVNP))/VP VP/NP 
>B 
(SVNP)SNNP))/NP 


(SUNP)/NP sə 
>B 
S/NP 
NAN ı 


We pass over the details of the S rules here, referring to Szabolcsi (1983, 1989) and 
Steedman (1987, 1996), noting not only that they are constrained by the Combinatory 
Profection Principle (12), but also that they exploit all degrees of freedom allowed un- 
der that principle, which correctly allows parasitism to be supported by complement 
subject extraction, forcing the choice of rightward subcategorization for the extracting 
NP in the subiect-extracting category (29). 


(31) A man that Harry will [tell you is a crooklypyyp, ,,, [while pretending to 


+WH 
admire|vypvvp)/NP 


A.1.A Pied-piping extraction 


The phenomenon of “pied-piping” in relativizations like the following is captured by 
giving relative pronouns like which (but not that) the further category shown in the fol- 


12 The reason for the extracting NP being a rightvvard argument will become clear below, when we 
consider the “Across-the-Board” constraint on extraction under coordination. 
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lowing example, the details of whose derivation we pass over (Steedman, 1987; Mor- 
rill, 1994, Steedman, 2012): 


(32) [[Reports]y Illthe height of the lettering on the covers ofluyçs/yp))/NP 
İvvhichlyayy)/s/x))xcsvsiyp))/yp)) [the government prescribeslş xp 


4.2 Crossing dependencies 


The inclusion of crossing composition rules, together vvith the folloving generaliza- 
tion of the composition rules to “higher order” cases with second-order secondary 
functors of the form (Y/Z)/TV allows the set of possible non-terminal categories to 
grow unboundedly, showing CCG grammars to be trans-context free. 


(33) a. Forward 2nd-order Composition: 


X/.Y:f (Y/Z/V/ :g ə (XİZİ/VV : AvvAzif (g w z) (>B2 ) 
b. Backvvard 2nd-order Composition: 

(YUVAM :g XA.Y:f = (XV : Avezif (g wz) (<B2 ) 
c. Forvvard Crossing 2nd-order Composition: 

XLY :f (YZ)W:g > (XVVV : AwAz.f (g wz) (>B2) 
d. Backward Crossing 2nd-order Composition: 

(Y/Z)/VV :g XVY:f > (K/Z)/W :ÀwAzf (g wz) (<B2) 


(Again, the effect of these rules is to “cancel” Y.) We assume following SP that these 
rules are the only higher-order composition rules. (In particular, there are no such 
rules with mixed directionality in the secondary function.) 

This feature of the theory allows elegant capture of a Germanic control construc- 
tion that allows unboundedly many arguments to cross dependencies with their gov- 
erning verbs, as in Figure 3a,b. This was a phenomenon which allowed the first for- 
malproof that naturallanguages were not even weakly context-free (Huybregts, 1984; 
Shieber, 1985, data for Zurich German from Shieber). 

Some alternative orders to those in Figure 3a,b including the following are cor- 
rectly also allowed in CCG (Shieber, 1985): 


(34) a. Das mer em Hans hàlfed es huus aastriiche. 


b. Das mer d”chind lönd em Hans hàlfe es huus aastriiche. 


5 Coordination 


5.1 Right node raising 


Because the category S/NP of the domain ofrelativization does not distinguish the sta- 
tus of the argument /NP as extracted or lexical, we immediately predict the possibility 
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of unbounded rightward movement, as in the following derivation: 


(35) İHarry seesl and [Fred says he likes) Sally 
>B 


S/NP š; (XuX)/X S/NP NPÍ 
(S/NP)VS/NP) 
(S/NP) 
< 
S 


The category (XV X)/X : ÀpÀq.p ni q of the conjunction is restricted by x slash- 
typing to only combining by the application rules (8) rather than the more restricted 
composition rules (23). This condition imposes Ross” 1967 “Across the Board” (ATB) 
constraint (1967) on both rightward and leftward extraction out of coordinate struc- 
tures including the “same case” condition, as in the following examples. 


(36) a. the woman that İsees Harryİyyp and [likes Fred]s yp 
the woman that [Harry seesİş/yp and [Fred likesly xp 
. “the woman that[Harry seesİş/yp and [Fred likes herlş 
. “the woman that[Harry sees herlş and [Fred likes]ç;yp 


e° S o D 


. “the woman that[Harry sees]s/yp and [likes Fred]s xp 


The restriction on the conjunction category prevents application to S in one con- 
junct and composition into S on the other. This restriction should be seen as a con- 
sequence of the semantics, which is defined as Partee and Rooth”s transitive closure 
p nq over function types (1983), and must therefore apply to p and q of the same type. 

Constituents including extracted complement subjects can coordinate with those 
containing extracted objects: 


(37) amanthatwehad invited, and believed would come 


Under the CCG account of coordination proposed here, this fact again forces the as- 
sumption that they have the same category—in this case VP,,,,,/NP—with the subject- 
extracted ones differing only in being restricted to wh “antecedent government” via 
the subfect-extracting category (29). 

The same restriction means that right-node raising corresponding to (37) is 
blocked, in a rare exception to the symmetry of right- and left-extraction: 


(358) *We had invited, and believed would come, the man who broke the bank at 
Monte Carlo. 


5.2 Argument/Adiunct cluster coordination 


Less obviously, the assumption that all arguments are mopholexically cased, or type- 
raised, including accusatives and datives as vvellas nominatives predicts the following 
“Argument/Adiunct Cluster” coordination (Dovvty, 1988): 
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(39) Give Harry books and Sally records 
(VP/NP)/NP (VP/NP)X(VP/NP)/NP) VP(VP/NP) (XNXX)/X (VP/NP)V(VP/NP)/NP) VPVVP/NP) 
give : Ap.p harıy : App books : AqAp.p q : Ap:p sally : Ap.p records 
VPV(VP/NP)/NP) bə VPV(VP/NP)/NP) 5 
: Ap.p harry books : Ap.p sally records 


5 
Ar.r harry books Apr 


VPV(VP/NP) əv 
: Arir harry books A r sally records 


VP : give harry books A give sally records 


The argument cluster coordination construction (39) is an example of a univer- 
sal tendencey for “deletion under coordination” to respect basic word order: in all lan- 
guages, if arguments are on the left of the verb then argument clusters coordinate on 
the left, if arguments are to the right of the verb then argument clusters coordinate to 
the right of the verb, while SVO languages pattern with verb-initial (Ross, 1970): 


(40) SVO: “SO and SVO SVO and 0 
VSO: “SO and VSO VSOand SO 
SOV: SOand SOV “SOV and 50 


For example, all contiguous substrings of Shieber”s Zurich German examples in Fig- 
ure 3a,b are correctly predicted to coordinate with sequences of the same type (Steed- 
man, 1985). (Hovvever, there is more to say concerning the precise mechanism that 
allows verb-medial gapping in the SVO case, and vvhy it patterns with VSO—see Steed- 
man, 1990, 2000.) 

Such cluster coordinations vvere the motivation for Pesetsky”s (1995) postulation 
of a level of “cascade structure” as the domain of binding and coordination, distinct 
from “layered structure”, the domain of movement. In CCG, layered and cascade struc- 
ture correspond to the same single level of derivation structure. 


6 On intonation structure and the notion “surface 


constituent” in CCG 


Itis important to be clear at this point that CCG categories like S/NP are not equivalent 
to SLASH notations in GPSG/HPSG. m particular, in CCG, a category of the form X/NP 
does not denote a constituent of type X including a trace or gap of type NP. 

On the contrary, S/NP is in CCG a constituent type in its own right, free to either 
combine with a preposed or irsitu relativized element, or to combine with a full NP—in 
particular, one that is right node-raised across-the-board, as in (35). 


13 The accounts of right node-raising in Gazdar (1981) and Gazdar et al. (1985) both require a separate 
metarule to license RNR. 
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As a consequence, CCG generalizes the notion of constituency beyond the tradi- 
tional notion to include any sequence that is typable using combinatory rules, including 
Harry sees, of type S/NP, and even Harry books, which we saw in (39) can be typed as 
VPV(VP/NP)/NP). As a consequence, CCG also necessarily allows (many) alternative 
derivations or constituent structures for canonical sentences. For example, as vvell as 
the earlier standard derivation (14), it allows the following: 


(41) Harry sees Sally 


NPİ (SWP)V/NP NPİ 
: App harry :sees App sally 


5 
S/NP : Ax.seesx harry 
S: sees sally harry 


For longer sentences of length n there will be a number of alternative analysis up to 
the nth number in the more than exponentially rapidly-growing Catalan series, all 
yielding the same logical form. 

This proliferation of constituent structures is sometimes referred to as “spuri- 
ous ambiguity”. However, it should not be regarded as a weakness in the combina- 
tory theory of grammar. Many languages have freer word order than English, and 
do not support any clear notion of surface constituency. Even for English, there is 
no clear consensus on whether the surface structure of the ditransitive VP or the 
subject auxilliary-inverted question is flat and ternary-branching, left-branching, or 
right-branching (Adger, 2003, 122-131, and cf. Barss and Lasnik, 1986; Larson, 1988, 
Pesetsky, 1995, Jackendoff, 1990; Larson, 1990, passim). Nor is structural ambiguity 
a problem for performance or processing. It is just a fact of life (many other construc- 
tions, such as noun-noun compounding, yield Catalan-serial numbers of analyses). 
Parsers are good at dealing with other sources of ambiguity by the use of an “oracle” 
such as a statistical model, and they can do the same with this one. (It is worth re- 
membering that ambiguity is endemic in all natural languages, and that none of them 
shows the slightest sign of evolving in the direction of reducing their overall level 
of ambiguity—Labov, 1994, 40-—42, chs. 19,20, Croft, 2000, 68, 102-4, Newmeyer, 
2003, 694, passim.) 

CCG”s unorthodox notion of constituency is transparently reflected in prosodic 
structure. Thus the following intonation contours appropriately mark the two alterna- 
tive derivations for the transitive clause appropriate to the two context setting ques- 
tions Q: 


(42) Q: Iknow BARRY sees ALICE. But who sees SALLY? 
A: (HARRY) (sees SALLY) 
H* L+H* LH% 


NP! SUNP 
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(43) Q: Iknow BARRY sees ALICE. But who does HARRY see? 


A: ( HARRY sees ) (SALLY) 
L-H” LH% H” LL 06 
S/NP NP! 


The notation for intonation-phrasal tunes is from Pierrehumbert and Hirschberg 
(1990). Here, L+H* LH % marks topic or Theme, H* LL 06 marks comment or Rheme. 
Exchanging the Afnsvver)s to the Q(uestions) is highly unacceptable. Steedman (2014) 
develops a theory of intonation structure and its meaning using the Alternative- 
Semantic framevvork of Rooth (2016). 

As in the Match Theory of Intonation Structure of Selkirk (2011) and its earlier 
Edge-marking incarnation (1990), Theme/Rheme marking is profected onto phrasal 
constituents directly, by syntactic derivation alone, bounded by combination of the 
phrase with an edge-based boundary tone. However, no independent extra-syntactic 
mechanism of “Focus Projection” is needed to achieve the semantics of “broad focus”. 
Nor are any violable constraints needed to explain departures of intonation structure 
from syntactic derivational structure, for there arenosuch departures. In CCG, surface 
syntactic structure is simply identical to phonological form. 

Thus, the domain of the prosodic phrase @ is the same as the domain of wh- 
movement, a state which is aspired to in Minimalist Contiguity Theory (Richards, 
2016, Ch.3; Richards, 2016, 9). Prosodic structure is thereby defined as part of “Nar- 
row Syntax” in the sense of Chomsky (2001)."" 


7 Explanatory adequacy 


CCG”s combination of type-raising and composition subject to the CPP (12) yields a 
permuting and rebracketing calculus closely tuned to the needs of and constraints on 
natural grammar. 

CCG thereby reduces Minimalisms' MOVE/INTERNAL MERGE and COPY/DELETE, 
together with intonational phrasing, to contiguous EXTERNAL MERGE. 

Constraints on dependency projection, such as the *that-t and across-the-board 
conditions, arise from the nature of the lexicon and combinatorics of CCG rather 
than from additional constraints on syntactic derivation. Part of this explanatory 
force arises from the low expressive power of the combinatory rules under the CPP, to 
which we now turn. 


14 However, there are some important differences. For Richards, the possibility of in situ wh-elements 
depends on everything between coMP and wh forming a single prosodic phrase. Otherwise, wh- 
movement is forced. In the present terms, v/h-movement also is only possible if everything can be 
composed and thereby also become a contiguous intonational phrase. 
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7.1 Expressive power of CCG 


CCG and TAG are provably weakly equivalent to Linear Indexed Grammar (LIG) (Vijay- 
Shanker and Weir, 1994, Kuhlmann etal., 2015).” Both are therefore “Mildly Context 
Sensitive” under the definition of Joshi (1988) and Joshi etal. (1991), which (among 
other properties), in the latter case calls for non-permutation-completeness. 

In particular, of the n! possible permutations on n functional heads, CCG only al- 
low a proportion defined by the nth Large Schróder Number S(n). For example, for a 
“cartographic” right-branching spine of only 8 functional heads, nearly 80 % of the 8! 
permutations are excluded. 

This property was first noted by Williams (2003, 125) for his categorial system 
CAT. Williams (2003, 209) wrongly claimed that the inclusion of type-raising in CCG 
would allow all orders. However, Williams failed to note that, as we have seen, type- 
raising in CCG is morpho-lexical and defined over lexical functions over the original 
types, rather than a free syntactic operation. Type-raising changes the set of types 
involved, and therefore changes the “Basic Order of Merger”, defined by purely ap- 
plicative derivation. Nevertheless, any fixed set of types, including raised types, is as 
a consequence non-permutation-complete. 

Specifically, for a set of four categories of the form A|B, BİC, CİD, D, determining 
a basic order of merger (1,2,3,4), 22 out of the 24 possible permutations are allowed. 
The two that CCG excludes are the following: 


(44) a. 3142 
b. 2413 


An example ofa construction of this form is the nounphrase construction investi- 
gated by Cinque (2005) and Abels and Neeleman (2012), and mentioned in Section 1.5, 
for which the categories in English are the following: 


(45) 1: theseypi yy, 2: fivey /x 3: fatyın 4: catSy 


The prediction is that no language will require or allow orders corresponding to either 
of the following glosses:” 


(46) a. “fat these cats five 
b. *five cats these fat 


These two orders are indeed not listed among the fourteen orders that Cinque identi- 
fes as attested for the languages of the world, nor are they included among the nine- 


15 VVeak equivalence means that they admit the same stringsets, though notvia the same derivations. 
Kuhlmann etal. (2015) show that the specific slash-typing version of CCG presented in this chapter is 
actually slightly less expressive than TAG. 

16 In testing such predictions, Cinque (2005) points out that it is important to be sure in particular 
that adjectives like “fat” are functioning as such, rather than being extraposed. 
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teen orders that Nchare (2012) identiftles as occurring in the free vvord-order language 
Shupamem. 

If we renumber the original set 1, 2, 3, 4 as X, 1, 2, 3, then (44b) is the “13-X-2 
constraint on movement observed by Svenonius (2007) for adjuncts, an observation 
which led Svenonius to complex stipulations of strong features and null functional 
heads to limit movement in “roll-up” derivations such as pied-piping in Germanic. 
Thus, it seems likely that the orders in (44) are indeed universally excluded. In CCG 
this restriction is of the kind identifted in Aspects as a Formal Universal, stemming 
vvithout stipulation from the theory of grammar itself, 

This property of non-permutation-completeness puts CCG at a different level of 
the extended language hierarchy of “abstract families of languages” (Ginsburg and 
Greibach, 1967) than standard Minimalist theories. Michaelis (1998, 2001) shovved that 
Minimalist Grammars (MG) under the definition of Stabler (2011) and including the 
“Shortest Move” or Minimal Link Condition (MLC) on movement, are weakly equiva- 
lent to Linear Context-Free Rewriting Systems (LCFRS), or, equivalently, to Multiple 
Context-Free Grammars (MCFG). At the time, it was conjectured that LCFRS/MCFG 
were Mildiy Context Sensitive (MCS), from which it seemed to follow that MG--MLC was 
also MCS. However, it has subsequently been shown by Salvati (2011/2015) that the ar- 
tifıcial language MIX, consisting of all permutations on the string abc" is an MCFL. 

Under the (informal) definition of Joshi et al. (1991), mild context-sensitivity ex- 
plicitly excludes permutation-completeness. So, since MCFL includes MIX, a permu- 
tation-complete language by definition, MCFG is not MCS under that definition, and 
so neither is MG. On the other hand, under the formal definition of MCS given in 
Kallmeyer, 2010, 23-24, the MCS languages include MCFL. Salvati suggests that the 
languages characterized by the vvell-nested subset of MCFG, MCFGyyy might formally 
correspond to the set of MCS languages, and shows that TAG is weakly in MCFGyn, 
while Kanazavva and Salvati (2012) show that MIX is not in MCFLıuyy. However, the 
MCTFGs to which Stabler”s MG corresponds are known to be non-vvell-nested (Boston 
et al., 2010), (although Kanazavva et al., 2011 shov that the addition of a further Spec- 
ifler Island Contraint to MG restricts them to a subset of MCFGyyyl. 

By contrast, Kanazavva and Salvati (2012) also show that MIX is nota Tree Adioin- 
ing Language (TAL), and therefore not a Combinatory Categorial Language (CCL), so 
TAG and CCG remain mildiy context sensitive in a stronger sense, without constraints, 
under all definitions. 

m this rather confusing state of affairs, it therefore seems helpful to distinguish 
the latter, not merely as mildly context sensitive, but more narrowly as “Slightly Non- 
Context-Free” (SNCF). 


7.2 Child language acquisition 


The restriction of the CCG lexicon and combinatory rules to strict type-transparency 
betvveen constituents and their logical forms means thatit supports a practically com- 
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putable model of child language acquisition via “semantic bootstrapping” (Pinker, 
1979), or, more properly, learning given access to contextually accessible universal 
logical form. 

The problem of child language acquisition then reduces to the problem of learn- 
ing (a) a lexicon, and (b) a parsing model, for all rules consistent with the (noisy) 
language-specific data and the (ambiguously) contextually available meaning repre- 
sentations, of which those sound-meaning pairs belonging to the actual target gram- 
mar will be vastly more frequent than the spurious ones (Abend etal., 2017). 

mterestingiy, the learner of Abend etal. gives a superficial appearance oflearning 
parameters. (For example, in the later stages of learning English, the probability mass 
assigned to the SVO category for an unseen transitive verb vvill be near1, and all other 
categories will be near 0.) However, there is no learned parametfic prior directly asso- 
ciating this semantic type vvith this category. Instead, the information is implicit in the 
probabilities assigned to the various instantiated syntactic rules in the grammar asa 
whole that are used in calculating the prior probabilities of alternative derivations. 


8 Conclusion: Tovvards a combinatory minimalism 


CCG is a theory that embodies in a very direct form the Minimalist insight that syn- 
tactic derivation is determined by need to create objects that are phonologically and 
semantically vvell-formed, and nothing else, reducing unbounded movement to con- 
tiguous composition and case to morpholexical type-raising. 

Other bounded types of movement, including “raising” and “control” relations, 
“head movement” (Roberts, 2001), “scrambling” (Ross, 1967), and “sidevvard move- 
ment” (Nunes, 2001) are defined statically, in the lexicon, at the level of logical form, 
from which they are profected by syntactic derivation. 

In view of recent invocations among Minimalists of both prosodic contiguity and 
type-raising, there seems to be a possibility of extending the Minimalist Program to 
cover the full range of movement, coordination, and prosodic structural phenom- 
ena under the following equation, subject to the combinatory projection principle 
(CPP), (12): 

Minimalism = Categorial Grammar + Case + Composition 
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